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(54) Transfer of guard values In a computer system 



(57) A method of transferring guard values and a 
computer system utilising the method are described. In 
a computer system using predicated instruction execu- 
tion, predicated on the values of particular guard indica- 
tors, a master set of guard indicators is held. The com- 
puter system has a parallel set of execution units, and 
the master set is held in association with one of the ex- 
ecution units. If other execution units require the guard 



values for particular guard indicators, a sendguard in- 
struction is issued to the execution unit holding the mas- 
ter guard values. The sendguard instructions are held 
in a separate queue from the main instructions intended 
for that execution unit. Moreover, circuitry is provided in 
the execution unit to avoid stalling in the despatch of 
sendguard instructions even in the context of earlier 
guard modifying instructions. 
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Description 

[0001] The present invention relates to a method of 
transferring guard values and to a computer system uti- 
lising the method. 

[0002] Computer systems can operate in which in- 
structions include a guard indicator sothat the execution 
or otherwise of the instruction is dependent upon the val- 
ue of the guard indicator. Problems do however arise 
when a plurality of instructions are executed simultane- 
ously in a plurality of parallel execution units. This may 
arise in computer systems operating in superscalar or 
very long instruction word (VLIW) instruction modes so 
that the execution of some instructions may affect the 
guard values to be associated with other instructions in 
parallel execution units. Problems may also arise from 
pipelined execution of instructions so that the execution 
of some later instructions commences prior to the com- 
pletion of execution of earlier instructions in the pipeline. 
[0003] In a computer system which requires the trans- 
fer of guard values between parallel execution units, it 
is important that the transfer of guard values is accom- 
plished as promptly and efficiently as possible. This 
however should be done without transferring incorrect 
guard values and, preferably, without stalling the oper- 
ation of the pipelined execution units. 
[0004] Guarded instruction execution can be referred 
to as predicated instruction execution. 
[0005] According to one aspect of the present inven- 
tion there is provided a computer system for executing 
instructions having assigned guard indicators, which 
system comprises instruction supply circuitry, a plurality 
of parallel execution units for receiving respective in- 
structions from the supply circuitry, each instruction hav- 
ing a respective guard indicator selected from a set of 
guard indicators, common to the plurality of execution 
units, one of said execution units including a master 
guard value store containing a master representation of 
current values for the guard indicators in said set of 
guard indicators and guard vaiu e transfer circuitry oper- 
able to transfer a guard value from said master store to 
another of said execution units in response to a send- 
guard instruction being executed in said one execution 
unit, wherein the instruction supply circuitry comprises 
a main instruction queue for holding instructions to be 
supplied to the parallel execution units and a subsidiary 
instruction queue for holding sendguard instructions, 
with the subsidiary queue having priority access to the 
execution pipelines to avoid unnecessary delays for ex- 
ecution of sendguard instructions. 
[0006] According to another aspect of the present in- 
vention there is provided a method of executing instruc- 
tions in a computer system, said instructions having as- 
signed guard indicators, which method comprises sup- 
plying a plurality of instructions to parallel execution 
units, each instruction having a respective guard indica- 
tor selected from a set of guard indicators common to 
the plurality of execution units, holding a master set of 



current values for the guard indicators in one execution 
unit, effecting a transfer of the master guard value from 
said one execution unit in response to a sendguard in- 
struction being executed in said one execution unit 

5 wherein instructions to be supplied to the parallel exe- 
cution unit, save for said sendguard instructions, are 
held in a main instruction queue and wherein said send- 
guard instructions are held in a subsidiary queue having 
priority access to the execution pipelines to avoid un- 

10 necessary delays for execution of sendguard instruc- 
tions. 

[0007] Thus there is a separate queue for sendguard 
instructions which has priority access to the execution 
pipeline. If this is done in the context of using guard val- 

is ue modifying instructions which can modify guard val- 
ues and which are held in the main instruction queue, it 
is important to check before each sendguard instruction 
is supplied to the execution pipeline that no earlier guard 
value modifying instructions affecting the guard value 

20 requested by the sendguard instruction is still waiting in 
the main instruction queue. 

[0008] According to a further aspect of the invention 
there is provided a pipelined execution unit for a com- 
puter system which comprises at least two pipelined 

25 stages, with an earlier one of the pipelined stages in- 
cluding sendguard circuitry responsive to a sendguard 
instruction to despatch a guard value lor a guard indi- 
cator defined in the sendguard instruction, and a later 
one of the pipelined stages including guard value mod- 

30 jfyjng circuitry for executing guard value modifying in- 
structions which cause the value of a guard indicator to 
be modified, the pipelined execution unit further com- 
prising: a master guard value store for holding a master 
representation of current values for guard indicators; 

36 and means for determining any dependencies between 
a sendguard instruction in the earlier pipelined stage 
and a guard modifying instruction in the later pipelined 
stage; and switching circuitry for selecting when a send 
guard instruction is executed and responsive to any 

40 such dependencies, whether a guard value held in the 
master guard value store or a guard value just modified 
by the guard value modifying circuitry is to be des- 
patched. 

[0009] According to a further aspect of the invention 
45 there is provided a method of executing instructions in 
a pipelined execution unit, each instruction having a re- 
spective guard indicator selected from a set of guard in- 
dicators and execution of the instructions being predi- 
cated on the values of the guard indicators, wherein the 
so instructions include a sendguard instruction which, 
when executed, causes the transfer of a guard value 
from the pipelined execution unit and a guard value 
modifying instruction which modifies the value of a 
guard indicator, the method comprising: supplying in- 
55 structions to the pipelined execution unit including said 
sendguard instructions and said guard value modifying 
instructions; checking dependencies between guard 
value modifying instructions supplied to the pipelined 
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execution unit earlier than a sendguard instruction re- 
lating to the same guard indicator; and supplying the 
guard value of the guard indicator requested in th e send- 
guard instruction selectively from a master guard value 
store or guard value modifying circuitry in dependence 
on the results of said dependency checks, so as to en- 
sure that the guard value of a guard indicator which is 
despatched responsive to a sendguard instruction is 
correct in relation to any earlier guard value modifying 
instructions in the pipelined execution unit. 
[0010] According to the embodiment described here- 
in, each execution unit comprises a plurality of pipelined 
stages with synchronised pipeline cycles for each of the 
execution units. Circuitry is provided to allow for prompt 
despatch of a guard value requested by a sendguard 
microinstruction, regardless of its dependency on earlier 
guard value modifying instructions in the pipeline, and 
without stalling the pipeline. 

[0011] For a better understanding of the present in- 
vention and to show how the same may be carried into 
effect, reference will now be made by way of example 
to the accompanying drawings in which: 

Figure 1 shows a computer system in accordance 
with one embodiment of the invention; 
Figure 2 shows the layout of the instruction queues; 
Figure 3 is a diagram illustrating circuitry in the pipe- 
line to avoid stalling when executing sendguard in- 
structions; and 

Figure 4 is a partial diagram of a microinstruction 
format. 

[0012] The computer system of this example is ar- 
ranged for the parallel execution of a plurality of instruc- 
tions and is particularly suited to providing a high digital 
signal processing (DSP) performance. Instructions are 
held in a program memory 11 and after passing through 
a control unit 12 are supplied to four parallel execution 
pipelines 13,14,15 and 16. Pipelines 13 and 14 are 
shown as slot 0 and slot 1 of a data unit arranged to 
execute instructions carrying arithmetic operations. 
Pipelines 15 and 16 are shown as slot 0 and slot 1 of an 
address unit 19 used to execute instructions for memory 
accesses to a data memory 20. Slot 1 or Slot 0 of the 
address unit 1 9 may also be used to supply instructions 
to a general unit 21 which shares some resources with 
the address unit 1 9. The general unit 21 includes a con- 
trol register file 22 as well as branch circuitry which is 
used to provide instruction branch information on line 
23 to the control unit 12. 

[001 3] The two pipelines 1 3 and 1 4 in the data unit 1 8 
share a common data register file 26 and a common 
guard register file 27 holding the master guard values 
which may be associated with the instructions. The two 
pipelines also derive instructions from a common in- 
struction dispatch queue (D-IDQ) 29 in the data unit 18 
and instructions in the queue 29 are checked for data 
dependency by common vertical dependency check cir- 



cuitry 30 in the data unit 1 8. The sequence of operations 
in each of the pipeline stages in the data unit 18 include 
an operand fetch usually accessing one of the register 
files 26 or 27 followed by two execution stages which 

5 may use arithmetic circuitry 33 followed by a data write 
stage at which the result of an arithmetic operation is 
returned to the register file 26 and 27. A similar pipeline 
exists for both pipelines 13 and 14 on the data unit 18. 
[0014] Similarly for the address unit 19, both pipelines 

10 15 and 16 access a common register file 40 holding 
pointer values for use in load or store operations in the 
data memory 20. The two pipelines each take their in- 
structions from a common instruction dispatch queue 
(A-IDQ) 41 and a similar vertical dependency check is 

is provided in common for both pipelines 15 and 16 in the 
address unit 19. The vertical dependency check is sim- 
ilar to that already referred to in the data unit 18. In ex- 
ecuting instructions through the two pipelines 1 5 and 1 6 : 
accesses will be made to the register file 40. Add and 

20 subtract units 43 may be used in execution of the in- 
structions. Each of the pipelines 15 and 16 in the ad- 
dress unit 19 includes an operand fetch followed by an 
execution stage and an address write back stage. 
[0015] Both the data unit 18 and the address unit 19 

2S are connected to the data memory 20 through a data 
memory interface control 50. The control 50 is connect- 
ed by buses 53 to both the data unit 1 8 and the address 
unit 19. The interlace control 50 includes a plurality of 
queues each connected to the buses 53. These queues 

30 include load data queues (LDQ) 60 for data awaiting 
loading into register files of the data unit 18 or address 
unit 1 9. The interface control 50 also includes store data 
queues (SDQ) 70 for data awaiting storage in the mem- 
ory 20. Store address queues (SAQ) 71 are provided to 

35 indicate the locations in the memory 20 at which the data 
is to be stored. 

[001 6] It will be understood that when instructions are 
executed to load data from the data memory 20 into the 
data register files of the data unit 18, the address unit 

40 1 9 will access the data memory 20 and load the required 
data into the load data queues 60 prior to completing 
the update of the data register file 26 by transferring the 
data from the appropriate queue 60. Similarly when in- 
structions are executed to store data from the data unit 

45 18 into the data memory 20 the appropriate data may 
be held in the store data queues 70 together with the 
store addresses in queue 71 prior to completing the 
store operation in the data memory 20. 
[0017] By executing the memory addressing instruc- 

50 tion in the address unit 1 9 in separate parallel pipelines 
from those provided in the data unit 18, the computer 
system operates access decoupling in that the memory 
accesses are effected independently of the arithmetic 
operations carried out within the data unit 18. This re- 

55 duces the problem of memory latency. In a digital signal 
processing system which operates regular and repeat- 
ed operations the memory latency can be hidden from 
the executing program. 
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[001 8] In the above description, all instructions which 
are fed through pipelines 13, 14, 15 and 16 are subject 
to a vertical dependency check and if any data depend- 
ency is found which cannot be resolved by a bypass, 
the execution unit operates to cause a temporary stall 5 
in one of the pair of pipelines 1 3 or 1 4 or in the pair 1 5 
and 16 so as to cause a temporary delay in one of the 
pipelines of the pair so as to resolve the data depend- 
ency. It will be understood that a check for a data de- 
pendency includes any form of data, including data rep- 
resenting guard values. The circuitry described in the 
following, however, reduces the likelihood of stalling on 
guard dependencies. 

[001 9] The control unit 1 2 is also arranged to provide 
a horizontal dependency check. In this specification a 
data dependency between instructions that are supplied 
to the parallel pipelines in the same machine cycle is 
referred to as a horizontal dependency. The control unit 
12 includes a program counter and address generator 
80 to provide a memory address for an instruction fetch 
operation from the program memory 11. The machine 
may operate in a selected one of a plurality of instruction 
modes including superscalar modes of variable instruc- 
tion bit length or in very long instruction word (VLIW) 
mode. The control unit 1 2 may include a process status 
register to indicate the instruction mode in which the ma- 
chine is operating. 

[0020] In use, a plurality of instructions are obtained 
from the memory 1 1 in a single fetch operation during 
one cycle and are decoded by a decoder 82 in the con- 
trol unit 12. They are checked for horizontal data de- 
pendency by dependency checking circuitry 87 to indi- 
cate if a horizontal data dependency has been located. 
After decoding, the instructions are used to generate mi- 
croinstructions for each of the execution pipelines. The 
instructions from the decoder 82 are passed to a micro- 
instruction generator 98 which generates a plurality of 
parallel microinstructions which are output by a dispatch 
circuitry 99 through parallel paths 1 00 to the four parallel 
execution pipelines 1 3, 1 4, 1 5 and 1 6 and for the general 
unit 21 . If a horizontal dependency was located, the mi- 
croinstructions on lines 100 would include an indication 
to the instruction despatch queues of the data unit 1 8 or 
address unit 19 that some action, such as a pipeline 
stall, was necessary in the execution pipelines to re- 
solve the horizontal dependency. 
[0021] In this example, each instruction may be pro- 
vided with a guard indicator G between GO and G15. If 
a guard is true, then the instruction is executed (i.e. up- 
dates the architectural state of machine); if a guard is 
false the architectural state of machine is not changed 
by instruction execution. Resolution of a guard may be 
done in different pipeline stages of the machine. 
[0022] The guard for each instruction may be selected 
between GO and G15 and in this particular example the 
guard G15 is always true. The value true or false attrib- 
uted to guards G0-G14 is however dependent upon the 
guard values held at any particular time in a guard reg- 



ister file. The master guard register file in this example 
is guard register file 27 (sometimes referred to herein 
as the architectural guard register) in the data unit 18. 
However, a supplementary or shadow guard register file 
(normally copies of the master file 27) is provided by a 
control unit guard register file 101 in the control unit 12. 
The control unit 12 also includes a register 102 to indi- 
cate which unit is currently known to be the guard owner 
in respect of each guard indicator Register 102 has a 
first bit 103 for each guard indicator which if holding the 
value 1 indicates that the address unit 19 is currently the 
guard holder for that indicator. If bit 104 for each guard 
indicator is set to the value 1 then it indicates that the 
control unit 1 2 is currently the guard owner for that guard 
indicator. If neither bit 1 03 nor 1 04 is set then the default 
condition indicates that the master guard register file 27 
must be used so that the data unit 1 8 is the guard owner. 
The guard values held in the guard register files can be 
changed by a guard modifier instruction (GMl) executed 
by the data unit 18 or the address unit 1 9. The address 
unit also has a shadow guard register file 100 which may 
be updated by guard modifier instructions executed by 
the address unit 19. Instructions executed by the data 
unit will update the master file 27 and the shadow file 
101 in the control unit. Those executed by the address 
unit will update the shadow file 100 and the master file 
27 (and hence the shadow file 101 in the control unit). 
[0023] In normal operation the guard register file 27 
in the data unit 18 maintains the architectural state of 
the guard values GO to G14 and the register file is com- 
mon to both execution pipelines 13 and 14. The opera- 
tive values of the guards are the same for all execution 
pipelines although as will be explained below, the differ- 
ent pipelines may access different register files to obtain 
the guard values. 

[0024] In this example the master register file for the 
guard values is held in the data unit 18 as it is the data 
unit that will most commonly execute instructions likely 
to change the value of the guards. Greater efficiency of 
execution cycles is therefore achieved by maintaining 
the master guard values in the register file which is di- 
rectly accessed by execution of the instructions in the 
data unit 18. When instructions are fed through either 
slot 0 or slot 1 of the data unit 18 the required guard 
value may be taken directly from the master guard reg- 
ister file 27 in accordance with the guard indicator that 
accompanied the microinstructions fed into the data unit 
18 from the control unit 12, unless the control unit 12 is 
the owner of the guard in which case the guard value 
will have been taken from the shadow registers 101 in 
the control unit 12. 

[0025] In the case of instructions to the address unit 
19, the more general position will be the default condi- 
tion in the guard owner register 1 02 indicating that guard 
ownership does not belong to the address unit 1 9 or the 
control unit 12 and consequently the guard values re- 
quired for execution of the instructions in the address 
unit 1 9 will need to be obtained from the guard register 



15 



20 



25 



30 



35 



40 



45 



50 



4 



EP 1 050 805 A1 



8 



file 27 in the data unit 18. The microinstructions fed 
through lines 100 to the execution units will include sup- 
plying a "sendguard" (sndg) instruction to the data unit 
18 as the same time as supplying the appropriate mi- 
croinstruction to the correct slot of the address unit 19. 
The "sendguard" instruction will be initially slotted into 
the instruction dispatch queue 29 of the data unit 18 in 
the same cycle of operations as the microinstruction re- 
quired lor the address unit 1 9 is slotted into the instruc- 
tion dispatch queue 41 for the address unit. All micro- 
instructions in a given execution unit are always execut- 
ed in order and all guard manipulations and transfers 
are maintained in order with respect to these micro-in- 
structions. This guarantees the synchronicity of guard 
transfers (i.e. for every guard emission from a given ex- 
ecution unit there is an opposite guard reception in an- 
other execution unit and all these are done in order. The 
control unit has responsibility to generate the correct mi- 
cro-instructions for guard transfers; the sending or re- 
ceiving execution unit only sees the send or receive (re- 
spectively) micro-instruction i.e. the action that it must 
do). In this way the correct sequencing occurs with the 
correct guard value being obtained from the guard reg- 
ister file 27 corresponding to the instruction being exe- 
cuted in the address unit 19. The supply of the "send- 
guard" instruction in such a situation is illustrated at 110 
in the drawing. As described in the following, a bypass 
mechanism for sendguard instructions is implemented 
which nevertheless respects the requirement for syn- 
chronicity of guard transfers. 

[0026] The address unit 19 has a queue of instruc- 
tions 41 awaiting dispatch to the execution units. It also 
has a queue 111 (ARLQ) of items awaiting loading into 
the pointer or control registers 40 or 22. There is also 
the queue 71 in the memory interface control 50 of store 
addresses queues awaiting a memory access as a re- 
sult ot partial execution of a store instruction in the ad- 
dress unit 1 9. When the address unit 19 awaits a guard 
transfer from the data unit 18, the instruction in the ad- 
dress unit 19 is stalled in the A-IDQ 41 or in the ARLQ 
1 1 1 or in the store address queue 71 until the requested 
guard value is transmitted from the data unit 18 through 
guard transfer circuitry 112 to the required destination. 
The transfer of the correct guard value will occur when 
the data unit 18 executes in its pipeline operation the 
"sendguard" instruction and the guard value which is 
transferred to either the address unit 19 or the data 
memory interface control 50 will need to be held in a 
queue ready for continued execution of the instruction 
once the stall is terminated. The transferred guard value 
will be held in an A-l DQ guard queue 1 1 3 if the guarded 
instruction was stalled in the IDQ 41. If the stall was in 
the ARLQ queue 111 then the transferred guard value 
will be held in an ARLQ guard queue 114. In the case 
of a store instruction where the store address had been 
added to the SAQ 71 in the interface control 50, the 
guard value will be transferred from circuitry 112 to an 
SAQ guard queue 1 1 5 in the data memory interface con- 



trol 50 so that the memory access may be implemented 
in accordance with the entry in the SAQ 71 if the guard 
value transferred permits this. It will be seen that by this 
provision, the address unit can execute a memory store 

5 instruction as far as identifying the required store ad- 
dress and adding that address to a queue in the inter- 
face 50 prior to checking whether or not the guard value 
of the store instruction is true or false. The store opera- 
tion will be held in the queue 71 until the guard value is 

10 checked and will only proceed to completion if the guard 
value is true. In each case where the guard value is 
transferred to the address unit 1 9 from the data unit 1 8, 
the stalled instruction forthe address unit or general unit 
21 will be resumed or rendered inoperative depending 

is on the guard value transferred from the data unit file 27. 
[0027] The use of the guard queues 113, 11 4 and 115 
allow resynchronisation of the guard values with the mi- 
croinstruction that caused the request "sendguard" 110 
to be sent to the data unit 18. The above description for 

20 operation of a guarded store instruction indicated how 
the effective store address could be put on the queue 
71 prior to resolving the guard value. The address unit 
1 9 may be operated with an earlier stall in the execution 
of a store instruction so that the effective address is not 

25 calculated and fed to the interface control 50 until after 
the guard value has been transferred and resolved. Sim- 
ilarly a guarded load instruction may be executed by the 
address unit 19 to access the memory and obtain the 
required data for addition to the load data queue 60 prior 

30 to resolving the guard value. Alternatively the address 
unit may cause an earlier stall awaiting resolution of the 
guard value transferred from the data unit prior to ob- 
taining the data from the memory and putting it into the 
queue 60. In the case where the data is obtained from 

35 the memory and put onto the load data queue 60 prior 
to resolution of the guard value : the appropriate register 
file 26, 40 or 22 is updated by a load operation from the 
load data queue 60 only if the guard value is found to 
be true. In the case of a false guard value, the register 

<o files are not updated and the appropriate execution unit 
effects a read of the load data queue 60 to remove the 
unwanted data from the queue without updating any 
destination register file. 

[0028] As indicated above, the guard values to be as- 
<5 signed to the guard indicators G0-G1 4 can be modified 
by executing a guard modifier instruction. These can be 
executed by either the data unit 18 or the address unit 
19. In the case of a guard modifier instruction executed 
by the data unit 18, the new guard value is calculated in 
50 the execution pipeline of the appropriate slot 0 or slot 1 
of the data unit 1 8 and, whichever of these pipelines has 
been used, it will cause a new value of the guard to be 
written into the guard register file 27 at the end of the 
execution pipeline in order to update the register file 27. 
55 The guard modifier instructions may themselves be 
guarded. 

[0029] In the case of the guard modifier instruction 
which is supplied to the address unit 19, the new value 
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will be calculated on execution of the instruction in either 
slot 0 or slot 1 pipeline of the address unit 19 and the 
new value can be written into the address guard register 
file 100 of the address unit 1 9 at the end of the execution 
pipeline. However it is necessary for the new guard val- 
ue to be transferred to the master guard register file 27 
of the data unit 18 in order to update that master file at 
the correct cycle of operation. To achieve this, the con- 
trol unit 12 arranges to supply a "receive guard" instruc- 
tion to the data unit 18 simultaneously with the supply 
of any microinstruction to the address unit 1 9 which will 
cause a new guard value to be calculated by the address 
unit 19. This "receive guard" instruction is illustrated at 

116 in the drawing and is added to the instruction dis- 
patch queue 29 of the data unit 18 in the same cycle as 
the microinstruction generating a new guard value is 
supplied to the address unit 19. As soon as the execu- 
tion pipeline in the address unit 19 has calculated the 
new guard value it is supplied to a transfer FIFO 117 1o 
enable supply of the new guard value to the data unit 

18. The "receive guard" instruction supplied to the data 
unit 1 8 is treated by the data unit 1 8 in the same way as 
a guard modifier instruction except that it does not need 
to calculate the new guard value but merely obtains it 
from the FIFO 117. 

[0030] The microinstruction generating the guard val- 
ue is supplied to the IDQ41 of the address unit 19 in the 
same cycle as the "receive guard" microinstruction is fed 
to the IDQ 29 of the data unit 18. As explained above, 
synchronisation is achieved between the two units so 
that the new guard value transferred through the FIFO 

117 updates the master register file 27 at the correct 
point in the sequence of the instruction execution. 
[0031] It is possible to execute a load GR instruction 
in order to load from memory 20 into the guard register 
files 27 and 100 of the data unit 18 in the address unit 

1 9. This instruction is treated as a load data register and 
a load pointer register instruction and the correct value 
is obtained from memory and put into the load data 
queues 60. The transfer from the load data queues 60 
to the address unit 1 9 and data unit 1 8 occurs independ- 
ently and the data is maintained in the load data queue 
60 until both units have read the appropriate data. The 
microinstructions in the instruction dispatch queues 29 
and 41 necessary to load the guard registers 27 and 1 00 
are stalled until the execution units of both the data unit 
18 and the address unit 19 is empty of any guard mod- 
ifier instructions. At that time the load guard register in- 
structions can proceed to transfer the required value 
from the load data queue 60 into the guard register file 
without conflict with any other guard modifier instruc- 
tions. 

[0032] It is apparent that when the data unit 18 re- 
ceives a sendguard microinstruction, the correct guard 
value should be selected and passed to the requesting 
execution unit as soon as possible to avoid unnecessary 
delays. A mechanism is described in the following which 
achieves this without unnecessary stalls in the data unit 



18. 

[0033] The instruction data queue 29 of the data unit 
is implemented as a FIFO having a plurality of storage 
locations, each of which holds two microinstructions 

5 (slot 0 and slot 1 ). Reference is made to Figure 2 which 
illustrates the main instruction FIFO 29 as having four 
storage locations L0 to L3, although in fact more than 
four would normally be provided. To implement the 
sendguard mechanism, a separate sendguard queue 

io implemented as a FIFO 200 is provided for holding the 
sendguard (sndg) fields of any sendguard microinstruc- 
tions written to the IDQ 29, each sndg field being asso- 
ciated with a sendguard tag sndg_tag. A write event 
counter 202 is incremented by 2 each time two new mi- 
's cro in st ructions are written to the main instruction FIFO 
29. It includes an overflow bit wr_ovfl to take into ac- 
count circular pointer effects. A read-event counter 204 
counts the number of microinstructions read into the da- 
ta unit pipelines 13,14. This may be two in each cycle 

20 or, where a horizontal slot dependency has been re- 
solved, only one (i.e. a split sequence). 
[0034] A gmi_tag register 206 holds a gmijag for 
each guard indicator. As already mentioned, there are 
fifteen guard indicators GO to G14 with assigned guard 

25 values although only three such tags are shown in Fig- 
ure 2 labelled guard0_tag,guard1_tag etc. Each tag has 
six bits. Four bits represent a write count which is a value 
read from the write counter 202 (plus 0 for slot 0, plus 1 
for slot 1 ) whenever a guard modifier instruction aff ect- 

30 ing that guard indicator is written to the FIFO 29. One 
bit is an aux_gmi_tag bit which is the overflow bit wr_ovfl 
from the write counter 202. The remaining bit is an active 
gmi bit act_gmi which indicates whether there is a valid 
guard modifier instruction (GMI) stored in the main in- 

35 struction FIFO 29 affecting that guard indicator. This bit 
is set to one if a new GMI affecting that guard indicator 
is written to the main instruction FIFO 29 where there 
was not one before or if there remain any active GMIs 
affecting that guard indicator in the main instruction 

40 FIFO 29. 

[0035] Each time a sndg instruction is written to the 
sndg queue 200, the gmi_tag value for the guard indi- 
cator referred to in the sndg instruction is written to the 
sndg_tag associated with the sndg field is the sndg 

45 queue 200, Thus this indicates the symbolic slot position 
of the youngest GMI in the main instruction FIFO 29 
modifying the requested guard indicator. 
[0036] The purpose of the separate sndg queue is to 
provide a bypass feature into the data unit pipelines 

so 1 3, 1 4 so that as soon as a sndg instruction arrives it can 
be supplied to one of the pipelines 13,14 as soon as 
possible without unnecessary delays. However, it is im- 
portant that it is not supplied to a pipeline in a cycle ear- 
lier than a GMI affecting that guard indicator which was 

55 expected to be executed earlier according to the instruc- 
tion sequence in the original program. To make sure that 
this does not happen, a bypass controller 208 compares 
the read event count with the sndg tag at the head of 
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the sndg queue 200 and the sndg instruction is supplied 
to the pipeline only it read-event counter > sndg-tag. 
Otherwise the next instruction in the FIFO 29 is fetched 
into the execution pipeline 13 or 14. 
[0037] A mechanism will now be described with refer- 
ence to Figure 3 which avoids stalls of sendguard in- 
structions which enter the data unit pipeline 1 3 or 1 4. In 
order to achieve this, it is necessary to know the depend- 
ency between a sendguard instruction despatched into 
the data unit pipeline 13 or 14 and any guard modifier 
instructions affecting the value of the guard to be sent. 
Any such dependencies are resolved by the vertical de- 
pendency checker 30 for the data unit pipelines. Signals 
from the vertical dependency checker are despatched 
into the data unit pipelines to control multiplexors de- 
scribed hereinafter to avoid stalling regardless of guard 
dependencies. 

[0038] The vertical dependency checker 30 uses in- 
formation provided in the fields of the microinstructions 
supplied to the data unit. Figure 4 illustrates the relevant 
fields of a microinstruction. It will be appreciated that the 
microinstruction contains a number of other fields, but 
these are not included herein because they do not affect 
the dependency resolution. For guard modifier instruc- 
tions, the relevant fields are dg-v which is equal to one 
when a guard is to be modified, and dg-guard which 
comprises four bits identifying which of the sixteen 
guard indicators is to be modified. 
[0039] The fields relevant to sendguard instructions 
are sndg-v which is equal to one when a. guard is to be 
sent and sndg-guard which identifies which guard indi- 
cator value is to be sent. 

[0040] As has briefly been explained above, each of 
the data unit pipelines comprises a data operand fetch 
stage 210, a first execution stage EX1 212, a second 
execution stage EX2 214 and a data writeback stage 
D WB 216. Instructions and data are propagated th rough 
the pipeline on successive machine cycles, such that 
each pipeline stage represents operations in each cycle. 
Figure 3 does not illustrate the components ol the pipe- 
line which are necessary for the normal execution of in- 
structions, nor does it show the interaction of the pipe- 
line with the data register file 26. The required fields of 
the microinstructions are used at each appropriate 
stage in the pipeline and in particular the GMI and send- 
guard fields. The GMI fields dg-v and dg-guard are re- 
ceived in the DOF stage and are propagated through 
the subsequent first and second execution stages EX1 
and EX2 as the instruction is executed. As a GMI in- 
struction might itself be guarded, if it occurs that the 
guard indicator on which the GMI instruction is guarded 
is false, the field dg-v is unset and continues to propa- 
gate in the pipe with the unset value of zero. The de- 
pendencies between the sendguard instructions to be 
executed and any GMI instructions currently in the pipe 
can be resolved by using simple comparators with re- 
spect to the GMI fields and the sendguard fields. For 
example for a sendguard instruction in the DOF stage 



of the pipeline, a comparison is made between its cor- 
responding sendguard fields sndg-v and sndg-guard 
and the information available in subsequent stages of 
the pipeline EX1 , EX2 relating to a possible GMI instruc- 

s tion ahead in the pipeline. For example, if a guard mod- 
ifier instruction affecting the guard indicator GO is in the 
first execution stage EX1 , and the sendguard instruction 
referring to GO is in the data operand fetch stage, a spe- 
cific signal is set indicating that such a dependency ex- 

10 ists. These signals are set by the vertical dependency 
checking unit 30 and are used to control multiplexors in 
the pipeline as described later. 
[0041] Circuitry tor executing sndg instructions is 
shown in dotted outline referenced 219 in DOF stage 

is 210 and 221 in EX1 stage 212. These sendguard exe- 
cution units 219,212 receive sndg opcode from the 
sendguard instruction. Guard indicators for addressing 
the guard register file 27 are also supplied with the send- 
guard instruction. It will be appreciated that the pipeline 

20 includes other units, omitted for the sake of clarity. Fig- 
ure 3 is merely a diagrammatic representation of the 
pipeline stages, and additionally illustrates circuitry pro- 
viding a mechanism for avoiding stalls of sendguard in- 
structions. The circuitry comprises a first selection mui- 

25 tiplexor 2 1 8 located in the data operand fetch stage 21 0 ; 
a second selection multiplexor 220 located in the first 
execution stage 21 2 and a guard selection multiplexor 
222 which can propagate the correct guard value in ei- 
ther the data operand fetch stage or the first execution 

30 stage as described in the following. A computation unit 
224 allows guard values to be computed responsive to 
guard modifier instructions in the second execution 
stage EX2 214. The output of the computation unit 224 
is supplied to the guard register file 27 via a guard value 

35 output multiplexor 226. The guard register file 27 is ad- 
dressable via the guard indicators in a manner not 
shown in Figure 3 such that the computed guard value 
is written against the correct guard indicator. In each cy- 
cle, for each guard indicator either the existing value is 

40 written back into the guard register file via the feedback 
loop 228, or the newly modified guard value is written to 
the guard register file for the particular guard indicator. 
The mechanism operates as follows. 
[0042] When there is no guard dependency, that is 

45 when there are no earlier instructions in the pipeline 
which imply the modification of the guard value which 
has been requested to be sent by the sendguard instruc- 
tion which has just entered the pipeline, the first selec- 
tion multiplexor 218 is controlled to supply the architec- 

50 tural guard value from the guard register file 27 and the 
guard selection multiplexor 222 outputs this guard value 
in the data operand fetch stage of the pipeline. Thus, 
only a single cycle is required to deal with the sendguard 
instruction and to despatch the correct guard value. 

55 [0043] When there is an earlier instruction in the pipe- 
line located two cycles ahead which modifies the guard 
value requested to be sent by the sendguard instruction, 
this guard value is computed in the second execution 
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stage of the pipeline as the sendguard instruction is re- 
ceived in the data operand fetch stage. The just com- 
puted guard value is supplied to the first selection mul- 
tiplexor 218 and is selected through to the guard selec- 
tion multiplexor 222. Thus, this also allows the correct 
guard value to be despatched in the first cycle of exe- 
cution of the sendguard instruction. 
[0044] The mechanism is slightly more complicated 
in the case where a guard modifying instruction is locat- 
ed one cycle ahead of the sendguard instruction and is 
modifying the requested guard value. In this case, the 
sendguard instruction is not dealt with in the data oper- 
and fetch stage, but is propagated though to the first 
execution stage EX1. In the same machine cycle, the 
guard modifying instruction ahead of it is computed by 
the computation unit 224 in the second execution stage 
EX2 and the just computed value is returned to the sec- 
ond selection multiplexor 220. In this case therefore, the 
guard selection multiplexor 222 despatches the output 
of the second selection multiplexor 220 rather than the 
output of the first selection multiplexor 218 thereby al- 
lowing the sendguard instruction to be dealt with in two 
cycles. Although this is slower than a single cycle 
achieved in the first two cases, the performance is still 
high because no stalls in the pipeline have occurred. 
[0045] By including a guard value store 230 at the out- 
put of the data operand fetch stage, it is also possible 
to deal with two sendguard instructions entering the 
pipeline one after the other immediately following a 
guard modifier instruction. The first sendguard instruc- 
tion which has the guard modifier instruction just in front 
of it by one cycle is dealt with as just described. That is, 
it is propagated through to the first execution stage EX1 
so that the guard value can be computed in the compu- 
tation unit 224 in the second execution stage EX2. The 
just computed guard value is then despatched as al- 
ready described. While this is happening it is however 
possible to deal with the second sendguard instruction 
t in the data operand fetch stage. If the sendguard instruc- 
tion relates to a guard value not affected by the guard 
modifier instruction, it can be dealt with in the normal 
way and the value selected from the architectural guard 
register 27 held in the store 230 ready for despatch on 
the subsequent cycle after the just computed guard val- 
ue has been despatched. If the second sendguard in- 
struction relates to the same guard indicator which has 
been modified by the earlier guard modifier instruction, 
the value which is held in the store 230 is the just com- 
puted value which is returned back to the first selection 
multiplexor 218 in the data operand fetch stage. This 
allows maximum use of the pipeline stages and avoids 
stalls of the sendguard instructions whatever are the de- 
pendencies. 



Claims 

1. A computer system for executing instructions hav- 



ing assigned guard indicators, which system com- 
prises instruction supply circuitry, a plurality of par- 
allel execution units for receiving respective instruc- 
tions from the supply circuitry, each instruction hav- 

5 ing a respective guard indicator selected from a set 
of guard indicators, common to the plurality of exe- 
cution units, one of said execution units including a 
master guard value store containing a master rep- 
resentation of current values for the guard indica- 

10 tors in said set of guard indicators and guard value 
transfer circuitry operable to transfer a guard value 
from said master store to another of said execution 
units in response to a sendguard instruction being 
executed in said one execution unit, wherein the in- 

is struction supply circuitry comprises a main instruc- 
tion queue for holding instructions to be supplied to 
the parallel execution units and a subsidiary instruc- 
tion queue for holding sendguard instructions, with 
the subsidiary queue having priority access to the 

20 execution pipelines to avoid unnecessary delays for 
execution of sendguard instructions. 

2. A computer system according to claim 1 in which 
said execution unit includes circuitry for executing 
25 guard value modifying instructions which are held 
k in the main instruction queue and for updating said 
master guard value store with any modified guard 
value. 

30 3. A computer system according to claim 2 in which 
the instruction supply circuitry comprises a control 
unit common to each parallel execution unit, said 
control unit including queue checking circuitry 
which is operable to check, before each sendguard 

35 instruction is supplied to the execution pipeline, that 
no earlier guard value modifying instructions affect- 
ing the guard value requested by the sendguard in- 
struction is still waiting in the main instruction 
queue. 

40 

4. A computer system according to any one of the pre- 
ceding claims in which each execution unit compris- 
es a plurality of pipelined stages with synchronised 
pipelined cycles for each ol the execution units. 

45 

5. A computer system according to claims 2 and 4, 
wherein the circuitry for executing guard value mod- 
ifying instructions is located in a pipelined stage 
downstream of an earlier pipelined stage operable 

50 to execute sendguard instructions. 

6. A computer system according to claim 5, compris- 
ing switching circuitry in the earlier pipelined stage 
for supplying a guard value which has been modi- 

55 fjed by a guard value modifying instruction in the 
subsequent pipelined stage in a machine cycle re- 
sponsive to a sendguard instruction in the earlier 
pipelined stage in the same machine cycle. 
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7. A computer system according to claim 1 , in which 
each execution unit comprises first, second and 
third pipelined stages wherein the first and second 
pipelined stages contain circuitry for executing 
sendguard instructions, and the third pipelined 
stage includes circuitry for executing guard value 
modifying instructions. 

8. A computer system according to claim 7, which 
comprises switching circuitry for selecting, respon- 
sive to a sendguard instruction, whether a guard 
value held in the master guard value store or a 
guard value just modified by a guard value modify- 
ing instruction is despatched via the guard value 
transfer circuitry. 

9. A computer system according to any one of the pre- 
ceding claims in which said execution unit include 
execution pipelines providing access to a data 
memory, said pipelines including a first set of pipe- 
lines for use in executing instructions needed for 
memory access operations and a second set of 
pipelines arranged to carry out arithmetic opera- 
tions, thereby providing decoupling of memory ac- 
cess operations from arithmetic operations. 

10. A computer system according to claim 9 in which 
two parallel pipelines are provided for arithmetic op- 
erations and access a common set of data registers 
including said master guard value store. 

11 . A computer system according to any one of the pre- 
ceding claims in which said master guard value 
store comprises a register file. 

12. A method of executing instructions in a computer 
system, said instructions having assigned guard in- 
dicators, which method comprises supplying a plu- 
rality of instructions to parallel execution units, each 
instruction having a respective guard indicator se- 
lected from a set of guard indicators common to the 
plurality of execution units, holding a master set of 
current values for the guard indicators in one exe- 
cution unit, effecting a transfer of the master guard 
value from said one execution unit in response to a 
sendguard instruction being executed in said one 
execution unit wherein instructions to be supplied 
to the parallel execution unit, save for said send- 
guard instructions, are held in a main instruction 
queue and wherein said sendguard instructions are 
held in a subsidiary queue having priority access to 
the execution pipelines to avoid unnecessary de- 
lays for execution of sendguard instructions. 

13. A method according to claim 12, including execut- 
ing instructions to modify said guard values wherein 
execution of an instruction to modify a guard value 
updates said master value, said guard value modi- 



fying instructions being held in the main instruction 
queue. 

14. A method according to claim 13. which comprises 
s the step of checking, before each sendguard in- 
struction is supplied to the execution pipeline, that 
no earlier guard value modifying instruction affect- 
ing the guard value requested by the sendguard in- 
struction is still waiting in the main instruction 

10 queue. 

15. A pipelined execution unit for a computer system 
which comprises at least two pipelined stages, with 
an earlier one of the pipelined stages including 

*5 sendguard circuitry responsive to a sendguard in- 
struction to despatch a guard value for a guard in- 
dicator defined in the sendguard instruction, and a 
later one of the pipelined stages including guard val- 
ue modifying circuitry for executing guard value 

20 modifying instructions which cause the value of a 
guard indicator to be modified, the pipelined execu- 
tion unit further comprising: 

a master guard value store for holding a master 
25 representation of current values for guard indi- 

cators; and 

means for determining any dependencies be- 
tween a sendguard instruction in the earlier 
pipelined stage and a guard modifying instruc- 

30 tion in the later pipelined stage; and 

switching circuitry for selecting when a send 
guard instruction is executed and responsive to 
any such dependencies, whether a guard value 
held in the master guard value store or a guard 

35 value just modified by the guard value modify- 

ing circuitry is to be despatched. 

16. A pipelined execution unit according to claim 15, 
which comprises first, second and third pipelined 

40 stages wherein the first and second pipelined stag- 
es each contain such circuitry for executing send- 
guard instructions, and the third pipelined stage in- 
cludes such circuitry for executing guard value 
modifying instructions. 

45 

17. A pipelined execution unit according to claim 15 or 
16, which includes a data write-back stage for writ- 
ing back the results of execution of an instruction 
into a data store. 

so 

1 8. A method of executing instructions in a pipelined ex- 
ecution unit, each instruction having a respective 
guard indicator selected from a set of guard indica- 
tors and execution of the instructions being predi- 

55 cated on the values of the guard indicators, wherein 
the instructions include a sendguard instruction 
which, when executed, causes the transfer of a 
guard value from the pipelined execution unit and a 
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guard value modifying instruction which modifies . 
the value of a guard indicator, the method compris- 
ing: 

supplying instructions to the pipelined execu- 5 
tion unit including said sendguard instructions 
and said guard value modifying instructions; 
checking dependencies between guard value 
modifying instructions supplied to the pipelined 
execution unit earlier than a sendguard instruc- io 
tion relating to the same guard indicator; and 
supplying the guard value ot the guard indicator 
requested in the sendguard instruction selec- 
tively from a master guard value store or guard 
value modifying circuitry in dependence on the 
results of said dependency checks, so as to en- 
sure that the guard value of a guard indicator 
which is despatched responsive to a sendguard 
instruction is correct in relation to any earlier 
guard value modifying instructions in the pipe- 20 
lined execution unit. 
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